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Abstract 

Background: Recent evidence suggests that less than one-quarter of patients with symptomatic nosocomial Clostridium 
difficile infections (CDI) are linked to other in-patients. However, this evidence was limited to one geographic area. We 
aimed to investigate the level of symptomatic CDI transmission in hospitals located across England from 2008 to 2012. 

Methods: A generalized additive mixed-effects Poisson model was fitted to English hospital-surveillance data. After 
adjusting for seasonal fluctuations and between-hospital variation in reported CDI over time, possible clustering 
(transmission between symptomatic in-patients) of CDI cases was identified. We hypothesised that a temporal proximity 
would be reflected in the degree of correlation between in-hospital CDI cases per week. This correlation was modelled 
through a latent autoregressive structure of order 1 {AR(1)). 

Findings: Forty-six hospitals (33 general, seven specialist, and six teaching hospitals) located in all English regions met our 
criteria. In total, 12,717 CDI cases were identified; seventy-five per cent of these occurred >48 hours after admission. There 
were slight increases in reports during winter months. We found a low, but statistically significant, correlation between 
successive weekly CDI case incidences (phi = 0.029, 95%CI: 0.009-0.049). This correlation was five times stronger in a 
subgroup analysis restricted to teaching hospitals (phi = 0.1 04, 95%CI: 0.048-0.159). 

Conclusions: The results suggest that symptomatic patient-to-patient transmission has been a source of CDI-acquisition in 
English hospitals in recent years, and that this might be a more important transmission route in teaching hospitals. 
Nonetheless, the weak correlation indicates that, in line with recent evidence, symptomatic cases might not be the primary 
source of nosocomial CDI in England. 
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Introduction 

Clostridium difficile infection (CDI) is a source of considerable 
morbidity and mortality for hospitalised patients, and its preven- 
tion, control and treatment place a substantial burden on 
healthcare systems[l,2]. Since 2007, in addition to improved 
antimicrobial stewardship and mandatory surveillance, enhanced 
infection control measures to prevent C. difficile transmission have 
been implemented in England. These measures have focused on 
isolating symptomatic patients and improving hospital-cleaning 
regimens, with the goal of meeting government-led CDI reduction 
targets. Reported cases of CDI have dropped from 55,498 in 
2007/08 to 18,005 in 201 1/12[3], at a time when the prevalence 
of the hyper-virulent C. difficile BI/NAPl/027 also decreased[4]. 
Apart from improved antimicrobial stewardship, guidelines for 
CDI prevention and control assume that symptomatic patients in 



hospitals account for most C. difficile transmission and consequent 
infection (CDI). However, in 2012 and 2013, research using whole 
genome sequencing of hospital and community isolates from 
Oxfordshire, United Kingdom, has challenged this assumption. 
Eyre et al found a high level of genomic diversity in samples from 
symptomatic CDI patients. Moreover, only a minority of hospital- 
onset cases of CDI were found to share an epidemiological link as 
well as genomic link with a symptomatic CDI case [5-7] . 

This recent evidence was limited to a small sample of hospitals 
that were all located in one English county. To explore whether 
these new developments in our understanding of the epidemiology 
of CDI are more generally applicable, we investigated the 
presence of clustering in symptomatic CDI patients, indicative of 
patient-to-patient C. difficile transmission, in a wide range of 
hospitals in England between 2008 and 2012. 
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Methods 

Data 

The dataset consisted of mandatory reported details of each 
identified GDI case >2 years of age collected from all 167 
National Health Service (NHS) Trusts via a web-enabled 
surveillance system, held by Public Health England (PHE)[8]. 
Details included the dates of admission and faecal sampling, 
patient category (e.g. inpatient, outpatient etc.) and age. Data 
covering the period between April 2008 and March 2012 were 
extracted from this surveillance scheme. To ensure consistency in 
the reported observations, we restricted our analyses to NHS acute 
trusts that followed the Department of Health's GDI testing 
guidance according to a survey held in 20 10 [9]. In England, a two 
test screening algorithm has been advocated and hospital trusts are 
recommended to test patients with diarrhoea (Bristol Stool Chart 
types 5-7) [10] using either a GDH Enzyme Immunoassay (GDH 
EIA), a nucleic acid amplification test (NAAT) or the Polymerase 
Ghain Reaction (PGR), followed by a toxin sensitive EIA (or a cell 
cytotoxin neutrahsation assay). If both the first test and the second 
test are positive, the case is eligible for reporting to PHE[1 1]. This 
resulted in the selection of data from 46 hospitals, belonging to 28 
individual NHS acute Trusts, and excluded any of the Oxfordshire 
hospitals (see table 1). Only GDI positive in-patients were included 
for analysis (i.e. excluding regular attendees, outpatients and 
patients having visited only accident and emergency departments). 
In order to evaluate healthcare facility associated infections, 
patients with onset of symptoms <48 hours after admission were 
excluded[12]. We aggregated the reported data per hospital by 
week, using the date of faecal sampling as the time of onset of GDI 
related symptoms. 

Statistical methods 

A generalized additive mixed-effects Poisson model, allowing for 
overdispersion, with a log link[13,14], was used for the weekly 
observations of GDI counts. Three effects were identified that 
required inclusion in the linear predictor of this model. Firstly, 

Table 1. Description of CDI data from 46 selected hospitals. 



hospital was introduced as a categorical variable to allow for 
potentially strong clustering due to differences in size, case-mix, 
and region [see table 1). Secondly, a fixed polynomial-by-hospital 
interaction term was included to accommodate varying rates of 
change (primarily decline) over the four-year period in observed 
GDI per hospital; Figure 1 shows the time series of symptomatic 
GDI per hospital. Thirdly, a cyclic effect was included using a 
periodic penalised cubic regression spline to accommodate 
seasonal patterns of GDI, as have been observed previously in 
settings outside England, and which have been attributed to 
increased levels of "at risk" antibiotic use (e.g. ciprofloxacin) 
during the winter months (January to March), and influenza 
(which can lead to secondary bacterial infections requiring 
antibiotic treatment) [15- 17]. The intention was that these three 
terms would account for the longitudinal behaviour of weekly GDI 
counts. Finally, a random error term was added to the linear 
predictor with an autoregressive correlation structure of order 1 
(AR(1)) that would accommodate local (in time) departures from 
this base model. The autoregressive component of this error would 
be an indicator of local statistical dependence, and its presence 
would serve as a proposed marker for transmission between 
symptomatic cases (either directly, or indirectiy via the hands of 
healthcare works or hospital surfaces contaminated by symptom- 
atic cases). Full details of the model are provided in the material 
SI. 

AU analyses were performed with R 3.0.1 (Team R Develop- 
ment Gore, website: http://cran.r-project.org/) using the R 
package mg(;»[18] and splines. To account more accurately for the 
decline in observed CDI since 2007, the comparative fit of three 
polynomials, linear, quadratic and cubic was assessed using Akaike 
Information Criterion (AIC). We added a cyclic (periodic) 
penalised cubic regression spline over the variable week of the 
year and compared model fit with and without this smoothing 
term representing seasonal variations, again based on AIC (see 
table ST). The standardized residuals were examined for significant 
departures from normality [19]. In addition, the Box-Pierce 
portmanteau statistic was used to indicate serial dependence. 
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Summary statistics of CDI cases reported to the English mandatory surveillance system by a selection of 46 hospitals from the period of April 2008 to March 2012. 
IQR = Interquartile range; p.w. = per week. 
doi:l 0.1 371/journal.pone.0099860.t001 
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Figure 1. Observed weekly number of CDI per hospital over the four-year study period. Grey dots represent the weekly-observed CDI 
cases within all hospitals from April 2008 to IVlarch 201 2. X-axis: Week 0 corresponds to the first week of April 2008 and week 209 to the last week of 
IVlarch 2012. Red line: the incidence trend over time illustrated by cubic smoothing spline fit (for illustration). 
doi:1 0.1 371 /journal.pone.0099860.g001 



Results 

Descriptive statistics 

The 46 hospitals reported 12,717 CDI cases in the four-year 
study period, of which 9,574 (75.3%) had an onset >48 hours after 
admission. Between 2008/09 and 2009/10 there was a 30.6% 
decline in CDI reported from these healthcare facilities, in 
comparison to 20.9% (2009/10 to 2010/11) and 15.7% (2010/ 
1 1 to 20 1 1 / 1 2) in the years thereafter. This is in line with national 
figures (29.1%, 18.0% and 17.1% respectively). Teaching hospitals 
reported the highest number of cases, which did not change once 
adjusted for their larger hospital size (expressed in the median 
number of cases per 10,000 bed-days available, where available 
bed-days is a crude estimate of the number of hospital beds in 
20 13 [20] multiplied by the number of days covered by the study, 
see table 1). 

Base-model assuming no transmission patterns 

For all three representations of the base-model, a model 
including seasonal patterns provided a moderately better fit, and a 
combination of seasonality and a cubic time trend proved the best 
model fit (see table SI). By examining the correlogram of the final 
base-model's normalized residuals, we could identify whether 
there was evidence of serial dependence {see figure 2A). Such 
dependence could be explained by transmission between symp- 
tomatic CDI carriers. Figure 2A illustrates a low but significant 
correlation between cases in a given week and symptomatic 
carriers present in the hospital one and two weeks earlier (p<0.05), 
with a slightly stronger correlation at two weeks. Taking a total of 
a 20-week interval (as transmission events between hospital cases 
with an onset further than 20 weeks apart is assumed to be 
rmlikely), the model revealed a highly significant Box-Pierce Q; 
statistic PC^ = 54.59, degrees of freedom (dl) = 20, p = 0.00005), 
indicating non-independence. Therefore, the AR(l)-model was 



fitted, with the best fitting cubic polynomial to represent the 
decline in CDI over time, as well as seasonality. 

AR(1)-model assuming transmission patterns 

Figure SI presents the seasonal variation of CDI in hospitals 
within our sample fitted by the AR(l)-model, and shows a slight 
increase during the months January to March. Assuming that 
symptomatic cases primarily caused acquisition among patients 
admitted to hospital up to one week later and, to a lesser extent, to 
cases admitted beyond this time (i.e. the AR-1 structure), the 
estimated magnitude of dependence was low, but statistically 
significant (<I> = 0.029 (95% CI = 0.009-0.049). This suggested that 
transmission between symptomatic CDI cases was affecting the 
weekly-observed CDI, but that its role in acquisition might be 
limited. This transmission pattern between observed weekly CDI 
was not fuUy explained by the AR(1) structure, as is indicated by 
the significant correlation at lag(week) 2 still being present after 
having fitted the AR(1) covariance structure (Box- Pierce QjStatistic 
(X^ = 44.4, df = 20, p-value = 0.001)) (see figure 3A). 

AR(1)-model by hospital type 

The negative correlation presented in the AR(1) cubic model 
after week 20 [see figure 3A) implied that the model might be over- 
fitting our data. Also, diagnostic plots suggested deviation from 
normality in the model's standardized residuals (see figure S2A). 
This can be explained by the large variability in the number of 
reported cases per hospital, with a much greater number of reports 
and related rate of change in reports over time from teaching 
hospitals compared to just a few cases from specialist and some 
general hospitals. As a consequence, a hospital-specific term in the 
model representing the change in CDI reports over time might not 
be suitable for hospitals with only a few cases reported, whereas 
such specification is required to represent the CDI trend in 
teaching hospitals. Fitting the model to the more homogeneous 
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Figure 2.Dependence between observed weekly number of CDI - base-model. A and B: autocorrelation function (ACF) of normalized 
residuals of the base-model fitted to data of all hospitals (A) and of teaching hospitals only (B) including a fitted cubic representation of the CDI trend 
over time and seasonality. The blue lines correspond to the threshold for significance of correlation (dependence) (p<0.05) between lagged weekly 
observations up to week 20. E.g. crossing of this threshold by the base-model residuals at lag 1 and lag 2 for the model fitted to all hospitals suggests 
that a correlation exists between the observed CDI in a given week and the number of CDI cases present in the hospital one and two weeks earlier. 
doi:1 0.1 371 /journal.pone.0099860.g002 



group of teaching hospitals only, revealed a stronger, but still 
relatively low statistically significant correlation between CDI cases 
and patients present in the hospital one week later (<I) = 0.104 
(95% CI = 0.048-0. 159) {see figure 2B), which was captured by the 
AR(l)-structure (Box-Pierce Q;Statistic =23.2, df=20, 



p = 0.281) (see figure 3B). Figure 4 illustrates the cubic AR(1) model 
predictions in comparison to the observed teaching hospital data 
and figure S2 the model diagnostics. 
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Figure 3. Dependence between observed weekly number of CDI - AR1 -model. A and B: autocorrelation function (ACF) of normalized 
residuals of the AR1 -model fitted to data of all hospitals (A) and of teaching hospitals only (B) including a fitted cubic representation of the CDI trend 
over time and seasonality. As in figure 2, the blue lines correspond to the threshold for significance of correlation (dependence) (p<0.05) between 
lagged weekly observations up to week 20. Crossing of this threshold by the AR1 -model residuals at lag 2 suggests the AR1 structure (symptomatic 
cases primarily cause acquisition of C. difficile among patients admitted to hospital up to one week later and, to a lesser extent, to cases admitted 
beyond this time), does not fully explain the dependence structure between weekly observations. 
doi:1 0.1 371/journal.pone.0099860.g003 
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Figure 4. AR(1) model fit teacKiing hospitals. Grey dots represent the weekly-observed CDI cases within the teaching hospitals from April 2008 
to March 2012. X-axis: Week 0 corresponds to the first week of April 2008 and week 209 to the last week of March 2012. Blue line: fit of the AR(1) 
model with a cubic representation of the rate of change of CDI over time and seasonality. 
doi:1 0.1 371 /journal.pone.0099860.g004 



Discussion 

In this study we explored the significance of symptomatic 
patient-to-patient CDI transmission in Englisli hospitals as a 
source of hospital onset CDI. We found a statistical significant 
signal of dependence between symptomatic CDI patients spending 
time in hospital close in time, which suggested symptomatic 
patient-to-patient transmission of CDI was present. Nonetheless, 
the low magnitude of correlation between weekly cases in the 
AR(1) model, implies that the role of symptomatic carriers in CDI- 
acquisition was not as important as previously supposed. The 
highest number of CDI cases was reported in teaching hospitals, 
which corresponded to their overall prevalence of hospital- 
acquired infections (HAI) being among the highest according to 
the English National Point Prevalence Survey on HAI [8]. This 
could be attributed to the more vulnerable case-mix of such 
hospitals, whom might be more prone to acquiring CDI[21]. 
Taking CDI reports from the teaching hospitals only, the 
association between symptomatic carriers was somewhat stronger, 
but still relatively low. Our findings are in line with recent 
evidence from whole genome sequencing of 1223 isolates from 
healthcare (among others from two large acute teaching hospitals, 
one specialist and one general district hospital) and community 
onset CDI cases in Oxfordshire, England isolated from 2008 to 
2011 [7]. Less than 20% of the genetically linked CDI positive 
cases had documented hospital contact with a symptomatic 
patient [7]. In addition, 45% of the included CDI cases could 
not be related to any other symptomatic case (community or 
healthcare setting) as they were too genetically diverse [7]. Even 
considering the reported low sensitivity of the toxin EIA test[12] 
used for CDI identification in the referenced study, the diversity 
argues for alternative sources of many CDI cases. 

Improved infection control, with a primary focus on preventing 
transmission, such as hand hygiene, isolation of symptomatic cases, 
and environmental cleaning, might result in lower rates of 
successful transmission between symptomatic cases following 
contact[22]. In addition, once a patient comes in contact with C. 
difficile or its spores, the development of CDI is dependent on the 
disruption of the normal gut flora, primarily due to antibiotic use 
such as broad-spectrum cephalosporins and quinolones[23,24]. 
Nonetheless, this would not explain the origin of symptomatic 
patients lacking a shared spatial-temporal and/or genetic link. C. 
difficile has been recovered from hospital rooms occupied by both 
symptomatic and asymptomatic carriers [25-2 7] and its spores can 
persist in the environment for as long as 20 weeks [28]. Therefore, 
transmission from contaminated hospital surfaces could suggest 
symptomatic hospital cases are unrelated, whereas actually 



indirect-cross infection could have occurred. However, a genetic 
link would still be found if cases had acquired their infection from 
the same contaminated hospital surface. If no restrictions were 
apphed to the infectious period, incubation period or length of 
ward contamination, 27% of the sequenced samples in the earlier 
mentioned study shared both a genetic and an epidemiological 
hospital contact [7]. Alternatively, asymptomatic carriers could 
contaminate hospital surfaces with lower intensity than symptom- 
atic carriers, hence cause acquisition at low frequency, which 
could potentially explain the wide genomic diversity among cases 
[29]. 

Importation of symptomatic and asymptomatic carriers from 
community-settings such as long-term care facilities (LTCF) has 
also been suggested as a source for hospital-onset CDI[30,31]. A 
population-based study conducted in the United States showed 
that out of a total of 416 identified CDI cases, 41% had onset of 
symptoms in the community or within 48 hours after admission 
and no hospitalisation in the 12 weeks prior to onset[32]. We 
excluded patients with an onset of CDI <48 hours into admission. 
This is a frequently used, but arbitrary, cut-off to define 
community-acquired HAI. Hence it is possible that our data 
included asymptomatic patients who acquired the bacteria 
elsewhere, and developed symptoms in the hospital >48 hours 
after admission. Moreover, in addition to onset within 48 hours 
into admission, no hospitalisation in the past 1 2 weeks is an often- 
used additional requirement for community-acquired CDI [11]. 
As we did not have information on previous hospitalisation, the 
possibility exists that cases defined as community-acquired in our 
data, and were therefore excluded, actually were hospital-acquired 
cases, i.e. patients who acquired C. difficile in their previous stay, 
but started to develop symptoms after discharge and were re- 
admitted with symptomatic CDI. Furthermore, approximately 
20% of cases with a first occurrence of CDI experience recurrence 
after discontinuation of treatment[33,34]. Re-admitted CDI 
carriers, who resolved their symptoms but remained colonised 
resulting in a recurrent episode once e.g. put on at risk antibiotics, 
could be partly responsible for the low correlation between 
symptomatic carriers. However, considering the known chances of 
relapse, we do not expect these can be primarily responsible for the 
results of this study. 

Finally, and although not our primary focus, we found evidence 
of seasonal variations in CDI incidence in our selection of English 
hospitals, with slightly elevated reports of hospital-associated 
symptomatic CDI in the winter months. Seasonality has been 
suggested in relation to increased levels of CDI related antibiotics 
during the winter months in settings outside of England [15- 17]. 
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Comparison of variability in antibiotic prescribing patterns within 
English hospitals with fluctuations in hospital reported GDI 
incidence would be an interesting area of investigation. Nonethe- 
less, the seasonal component in our model only explained a small 
proportion of the behaviour of the weekly reported GDI (reflected 
by a moderate decrease of AIC), and we would like to urge for 
more research on the presence of seasonal patterns of GDI in 
England. 

This study had several limitations. Firsdy, we selected weekly 
intervals for our analysis. Both the incubation time and infectious 
period of C. difficile have not been quantified with certainty. Studies 
have suggested that person-to-person transmission occurs primar- 
ily within a week (ranging from a median of 1, 4 or 8 days after 
GDI diagnosis) [5], whereas a median incubation time of 2-3 
days [12] to 18-33 days has been proposedp]. Hence, onset of 
symptoms following symptomatic patient-to-patient transmission 
might occur after the one-week time interval, which could have 
affected the strength of correlation between weekly incidences. 
Secondly, strains may vary in their pathogenicity [35] and 
transmissibility[6]. The routinely collected surveillance data did 
not contain ribotype specific information, so we could not establish 
to what extent our results are strain-specific as well as whether the 
hospitals in our sample are representative with regards to strain 
prevalence. Moreover, the ARl structure was unable to fuUy 
explain the correlation between weekly cases close in time using 
data from all hospital types, whereas it could for the teaching 
hospitals only. This might be a consequence of the stochastic 
nature of the few GDI cases reported by the smaller hospitals 
included in the overall dataset. Alternatively, teaching hospitals 
might have better environmental cleaning practices in place and/ 
or are more likely to change antibiotic prescribing practices 
following an outbreak, resulting in more rapid containment. 
Further research is needed to clarify the observed heterogeneity in 
reported hospital-acquired infection rates and transmission 
between teaching and non-teaching hospitals. Finally, alternative 
causes of dependence of the weekly GDI observations cannot be 
ruled out, e.g. a Scottish study[36] identified a temporal 
correlation between antibiotic use and HA-GDI [36]. However, 
as the results of the referenced study[36] suggest, it is unlikely that 
antibiotic hospital consumption will fluctuate between weekly time 
intervals. After investigation of the association between monthly 
variations in antibiotic use* and monthly \'ariations in observed 
GDI, Vernaz and colleagues (2009) identified that, for almost all of 
the antibiotics investigated, the association with observed GDI was 
significant with a lag of several months (among others ciproflox- 
acin, fluoroquinolones and cefuroxime) [36]. In addition, cases 
arising from asymptomatic carriers or environmental sources 
might correlate in space and time as well. However, the level of 
onward transmission from asymptomatically colonised individuals 
is highly uncertain, nor has foodbome transmission of C. difficik to 
humans been established with certainty [37,38]. Given the 
infectious nature of symptomatic C. difficile cases, especially in 
settings with high antibiotic use, we expect symptomatic patient- 
to-patient transmission to be the most conservative explanation. 

Despite the limited information present in routinely collected 
hospital infection data, this study has provided further insight in 
the hospital transmission dynamics of C. difficile. Our results 
indicate that patient-to-patient transmission when only those 
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patients with symptomatic GDI are considered, may account for a 
small number of transmission events. To improve our under- 
standing of the epidemiology of GDI, the role of other patient 

groups should be considered, such as those in the community and 
asymptomatic carriers, as well as the importance of indirect 
transmission from contaminated surfaces in the hospital environ- 
ment and the role of antibiotic use. Individual-level patient data, 
which can inform dynamic transmission models would certainly 
aid the investigating and quantification of the potential sources of 
GDI transmission[39-41] and wiU be another area of our further 
investigations. 

Supporting Information 

Figure SI Seasonal variations of symptomatic C. diffi- 
cile infection with onset >48 hours after admission. 

Fitted cyclic penalised cubic regression spline (representing 
seasonal variations) for the cubic AR(1) model fitted to data of 
all hospitals (A) and data of the teaching hospitals only (B). 
(TIF) 

Figure S2 Diagnostic plots cubic AR(l)-models. A: 

Residual diagnostics of AR(l)-model fitted to data of all hospital, 
including a fitted cubic representation of GDI behaviour over time 
and seasonality. A: Quantile-Qtiantile (Q,-Q) plot, deviation from a 
straight line denotes deviation from normal distribution, B: 
residuals plotted against linear predictor; G: frequency distribution 
of the model residuals; D: data against fitted values. E-H: Residual 
diagnostics of AR(l)-model fitted to data of teaching hospital only. 
(TIF) 

Material SI Model description and fitting procedure. 

Further description of the model used, and model fitting and 

selection procedure. 

(DOGX) 

Table SI Comparison of model fit for the base-model 
including alternative representations of the GDI inci- 
dence trend. AIG = Akaike Information criterion, the lo\\ er the 
value, the better the fit. Upper half of table: model fit to data of all 
hospitals. Lower half: model fit to data from teaching hospitals. 
Left half of table: model fit assuming a Poisson distribution. Right 
half model fit assuming an overdispersed Poisson (quasi-Poisson) 
distribution. 
(DOGX) 
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